Method summary (v2.0)  
  
Design model  
Each record is encoded as a spatial maturation chain:  
targeting_route -> folding_module -> protease_or_processing_enzyme -> post_processing_destination.  
  
Inclusion criteria  
1) Mechanistically clear relevance to archaeal protein maturation.  
2) Explicit or inferable processing location (cytosol, membrane, surface).  
3) Cleavage polarity determinable as N-terminus, C-terminus, Internal, or None.  
4) Evidence source classifiable as Review or Primary.  
  
Curation workflow  
1. Select substrate/process class from archaeal protein biogenesis domains.  
2. Assign targeting route (Cytosolic, Sec, Tat, Sec-like, or composite where needed).  
3. Assign dominant folding module (e.g., thermosome-centered cytosolic folding).  
4. Assign processing enzyme class (SPI, prepilin peptidase-like, ArtA family, or None).  
5. Encode cleavage polarity and processing site context.  
6. Assign post-processing destination and cellular zone.  
7. Add readout signature suitable for experimental interpretation.  
8. Assign evidence_level, confidence_tier, and curation_status.  
  
Quality control  
- Unique record_id values.  
- Required fields populated.  
- Controlled-value fields normalized to dictionary values.  
- Sentence-level consistency in descriptive fields.  
- Ambiguous entries labeled provisional.  
- No patient-level or identifiable data.  
  
Scope  
This dataset is a structured synthesis for educational and research use.  
It is not a clinical decision resource.  
